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SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: SEED, BRIAN 

HAAS , JURGEN 



(ii) TITLE OF THE INVENTION: HIGH LEVEL EXPRESSION OF 



(iii) NUMBER OF SEQUENCES: 110 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/717,2 94 

(B) FILING DATE: 20-SEP-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Elbing, Karen L 

(B) REGISTRATION NUMBER: 35,238 

(C) REFERENCE/DOCKET NUMBER: 00786/345001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-428-0200 

(B) TELEFAX: 617-428-7045 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



PROTEINS 



CGCGGGCTAG CCACCGAGAA GCTG 



24 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 



I 



• # 



(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

AC CGAGAAGC TGTGGGTGAC CGTGTACTAC GGCGTGCCCG TGTGGAAGAG AGGCCACCAC 60 

CACCCTGTTC TGCGCCAGCG ACGCCAAGGC GTACGACACC GAGGTGCACA ACGTGTGGGC 12 0 

CACCCAGGCG TGCGTGCCCA CCGACCCCAA CCCCCAGGAG GTGGAGCTCG TGAACGTGAC 180 

CGAGAACTTC AACAT 195 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CCACCATGTT GTTCTTCCAC ATGTTGAAGT TCTC 3 4 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:. 
GACCGAGAAC TTCAACATGT GGAAGAACAA CAT 3 3 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

TGGAAGAACA ACATGGTGGA GCAGATGCAT GAGGACATCA TCAGCCTGTG GGACCAGAGC 60 

CTGAAGCCCT GCGTGAAGCT GACCCCCTGT GCGTGACCTG AACTGCACCG ACCTGAGGAA 12 0 

CACCACCAAC ACCAACACAG CACCGCCAAC AACAACAGCA ACAGCGAGGG CACCATCAAG 180 

GGCGGCGAGA TG 192 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 




(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GTTGAAGCTG CAGTTCTTCA TCTCGCCGCC CTT 33 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GAAGAACTGC AGCTTCAACA TCACCACCAG C 31 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

AACATCACCA CCAGCATCCG CGACAAGATG CAGAAGGAGT ACGCCCTGCT GTACAAGCTG 60 

GATATCGTGA GCATCGACAA CGACAGCACC AGCTACCGCC TGATCTCCTG CAACACCAGC 12 0 

GTGATCACCC AGGCCTGCCC CAAGATCAGC TTCGAGCCCA TCCCCATCCA CTACTGCGCC 180 

CCCGCCGGCT TCGCC 195 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) " LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GAACTTCTTG TCGGCGGCGA AGCCGGCGGG 3 0 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



# 



GCGCCCCCGC CGGCTTCGCC ATCCTGAAGT GCAACGACAA GAAGTTC 4 7 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCCGACAAGA AGTTCAGCGG CAAGGGCAGC TGCAAGAACG TGAGCACCGT GCAGTGCACC 60 

CACGGCATCC GGCCGGTGGT GAGCACCCAG CTCCTGCTGA ACGGCAGCCT GGCCGAGGAG 120 

GAGGTGGTGA TCCGCAGCGA GAACTTCACC GACAACGCCA AGACCATCAT CGTGCACCTG 180 

AATGAGAGCG TGCAGATC 198 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
AGTTGGGACG CGTGCAGTTG ATCTGCACGC TCTC 34 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
GAGAGCGTGC AGATCAACTG CACGCGTCCC 3 0 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

AACTGCACGC GTCCCAACTA CAACAAGCGC AAGCGCATCC ACATCGGCCC CGGGCGCGCC 60 
TTCTACACCA CCAAGAACAT CATCGGCACC ATCCTCCAGG CCCACTGCAA CATCTCTAGA 12 0 



(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 



• 



(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTCGTTCCAC TTGGCTCTAG AGATGTTGCA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCAACATCTC TAGAGCCAAG TGGAACGAC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS.: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

GCCAAGTGGA ACGACACCCT GCGCCAGATC GTGAGCAAGC TGAAGGAGCA GTTCAAGAAC 
AAGACCATCG TGTTCACCAG AGCAGCGGCG GCGACCCCGA GATCGTGATG CACAGCTTCA 
ACTGCGGCGG C 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCAGTAGAAG AATTCGCCGC CGCAGTTGA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TCAACTGCGG CGGCGAATTC TTCTACTGC 2 9 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGCGAATTCT TCTACTGCAA CACCAGCCCC CTGTTCAACA GCACCTGGAA CGGCAACAAC 60 

ACCTGGAACA ACACCACCGG CAGCAACAAC AATATTACCC TCCAGTGCAA GATCAAGCAG 12 0 

ATCATCAACA TGTGGCAGGA GGTGGGCAAG GCCATGTACG CCCCCCCCAT CGAGGGCCAG 180 

ATCCGGTGCA GCAGC 195 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCAGACCGGT GATGTTGCTG CTGCACCGGA TCTGGCCCTC 4 0 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 .base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CGAGGGCCAG ATCCGGTGCA GCAGCAACAT CACCGGTCTG 40 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

AACATCACCG GTCTGCTGCT GACCCGCGAC GGCGGCAAGG ACACCGACAC CAACGACACC 60 

GAAATCTTCC GCCCCGGCGG CGGCGACATG CGCGACAACT GGAGATCTGA GCTGTACAAG 12 0 

TACAAGGTGG TGACGATCGA GCCCCTGGGC GTGGCCCCCA CCAAGGCCAA GCGCCGCGTG 180 

GTGCAGCGCG AGAAGCGC 198 



(2) INFORMATION FOR SEQ ID NO:24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGCGGGCGGC CGCTTTAGCG CTTCTCGCGC TGCACCAC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CGCGGGGGAT CCAAGCTTAC CATGATTCCA GTAATAAGT 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ATGAAT C C AG TAATAAGTAT AACATTATTA TTAAGTGTAT TACAAATGAG TAGAGGACAA 
AGAGTAATAA GTTTAACAGC ATCTTTAGTA AATCAAAATT TGAGATTAGA TTGTAGACAT 
GAAAATAATA CAAATTTGCC AATACAACAT GAATTTTCAT TAACG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CGCGGGGAAT TCACGCGTTA ATGAAAATTC ATGTTG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CGCGGATCCA CGCGTGAAAA AAAAAAACAT 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CGTGAAAAAA AAAAACATGT ATTAAGTGGA ACATTAGGAG TACCAGAACA TACATATAGA 
AGTAGAGTAA ATTTGTTTAG TGATAGATTC ATAAAAGTAT TAACATTAGC AAATTTTACA 
ACAAAAGATG AAGGAGATTA TATGTGTGAG 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CGCGAATTCG AGCTCACACA TATAATCTCC 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CGCGGATCCG AGCTCAGAGT . AAGTGGACAA 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CTCAGAGTAA GTGGACAAAA TCCAACAAGT AGTAATAAAA CAATAAATGT AATAAGAGAT 



AAATTAGTAA AATGTGAGGA ATAAGTTTAT TAGTACAAAA TACAAGTTGG TTATTATTAT 12 0 

TATTATTAAG TTTAAGTTTT TTACAAGCAA CAGATTTTAT AAGTTTATGA 170 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
CGCGAATTCG CGGCCGCTTC ATAAACTTAT AAAATC 3 6 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1632 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CTCGAGATCC ATTGTGCTCT AAAGGAGATA CCCGGCCAGA CACCCTCACC TGCGGTGCCC 60 

AGCTGCCCAG GCTGAGGCAA GAGAAGGCCA GAAACCATGC CCATGGGGTC TCTGCAACCG 12 0 

CTGGCCACCT TGTACCTGCT GGGGATGCTG GTCGCTTCCG TGCTAGCCAC CGAGAAGCTG 180 

TGGGTGACCG TGTACTACGG CGTGCCCGTG TGGAAGGAGG CCACCACCAC CCTGTTCTGC 24 0 

GCCAGCGACG CCAAGGCGTA CGACACCGAG GTGCACAACG TGTGGGCCAC CCAGGCGTGC 3 00 

GTGCCCACCG ACCCCAACCC CCAGGAGGTG GAGCTCGTGA ACGTGACCGA GAACTTCAAC 360 

ATGTGGAAGA ACAACATGGT GGAGCAGATG CATGAGGACA TCATCAGCCT GTGGGACCAG 42 0 

AGCCTGAAGC CCTGCGTGAA GCTGACCCCC CTGTGCGTGA CCCTGAACTG CACCGACCTG 480 

AGGAACACCA CCAACACCAA CAACAGCACC GCCAACAACA ACAGCAACAG CGAGGGCACC 54 0 

ATCAAGGGCG GCGAGATGAA CAACTGCAGC TTCAACATCA CCACCAGCAT CCGCGACAAG 600 

ATGCAGAAGG AGTACGCCCT GCTGTACAAG CTGGATATCG TGAGCATCGA CAACGACAGC 660 

ACCAGCTACC GCCTGATCTC CTGCAACACC AGCGTGATCA CCCAGGCCTG GCCCAAGATC 72 0 

AGCTTCGAGC CCATCCCCAT CCACTACTGC GCCCCCGCCG GCTTCGCCAT CCTGAAGTGC 7 80 

AACGACAAGA AGTTCAGCGG CAAGGGCAGC TGCAAGAACG TGAGCACCGT GCAGTGCACC 84 0 

CACGGCATCC GGCCGGTGGT GAGCACCCAG CTCCTGCTGA ACGGCAGCCT GGCCGAGGAG 900 

GAGGTGGTGA TCCGCAGCGA GAACTTCACC GACAACGCCA AGACCATCAT CGTGCACCTG 960 

AATGAGAGCG TGCAGATCAA CTGCACGCGT CCCAACTACA ACAAGCGCAA GCGCATCCAC 102 0 

ATCGGCCCCG GGCGCGCCTT CTACACCACC AAGAACATCA TCGGCACCAT CCGCCAGGCC 10 80 

CACTGCAACA TCTCTAGAGC CAAGTGGAAC GACACCCTGC GCCAGATCGT GAGCAAGCTG 114 0 

AAGGAGCAGT TCAAGAACAA GACCATCGTG TTCAACCAGA GCAGCGGCGG CGACCCCGAG 12 00 

ATCGTGATGC ACAGCTTCAA CTGCGGCGGC GAATTCTTCT ACTGCAACAC CAGCCCCCTG 12 60 

TTCAACAGCA CCTGGAACGG CAACAACACC TGGAACAACA CCACCGGCAG CAACAACAAT 132 0 

ATTACCCTCC AGTGCAAGAT CAAGCAGATC ATCAACATGT GGCAGGAGGT GGGCAAGGCC 13 8 0 

ATGTACGCCC CCCCCATCGA GGGCCAGATC CGGTGCAGCA GCAACATCAC CGGTCTGCTG 144 0 

CTGACCCGCG ACGGCGGCAA GGACACCGAC ACCAACGACA CCGAAATCTT CCGCCCCGGC 1500 

GGCGGCGACA TGCGCGACAA CTGGAGATCT GAGCTGTACA AGTACAAGGT GGTGACGATC 1560 

GAGCCCCTGG GCGTGGCCCC CACCAAGGCC AAGCGCCGCG TGGTGCAGCG CGAGAAGCGC 162 0 

TAAAGCGGCC GC 1632 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION 



SEQ ID NO: 35: 



ACCGAGAAGC TGTGGGTGAC CGTGTACTAC GGCGTGCCCG TGTGGAAGGA GGCCACCACC 
ACCCTGTTCT GCGCCAGCGA CGCCAAGGCG TACGACACCG AGGTGCACAA CGTGTGGGCC 
ACCCAGGCGT GCGTGCCCAC CGACCCCAAC CCCCAGGAGG TGGAGCTCGT GAACGTGACC 
GAGAACTTCA ACATGTGGAA GAACAACATG CTGGAGCAGA TGCATGAGGA CATCATCAGC 
CTGTGGGACC AGAGCCTGAA GCCCTGCGTG AAGCTGACCC CCCTGTGCGT GACCCTGAAC 
TGCACCGACC TGAGGAACAC CACCAACACC AACAACAGCA CCGCCAACAA CAACAGCAAC 
AGCGAGGGCA CCATCAAGGG CGGCGAGATG AAGAACTGCA GCTTCAACAT CACCACCAGC 
ATCCGCGACA AGATGCAGAA GGAGTACGCC CTGCTGTACA AGCTGGATAT CGTGAGCATC 
CACAACGACA GCACCAGCTA CCGCCTGATC TCCTGCAACA CCAGCGTGAT CACCCAGGCC 
TGCCCCAAGA TCAGCTTCGA GCCCATCCCC ATCCACTACT GCGCCCCCGC CGGCTTCGCC 
ATCCTGAAGT GCAACGACAA GAAGTTCAGC GGCAAGGGCA GCTGCAAGAA CGTGACCACC 
GTGCAGTGCA CCCACGGCAT CCGGCCGGTG GTGAGCACCC AGCTCCTGCT GAACGGCAGC 
CTGGCCGAGG AGGAGGTGGT GATCCGCAGC GAGAACTTCA CCGACAACGC CAAGACCATC 
ATCGTGCACC TGAATGAGAG CGTGCAGATC AACTGCACGC GTCCCAACTA CAACAAGCGC 
AAGCGCATCC ACATCGGCCC CGGGCGCGCC TTCTACACCA CCAAGAACAT CATCGGCACC 
ATCCGCCAGG CCCACTGCAA CATCTCTAGA GCCAAGTGGA ACGACAg'CCT GCGCCAGATC 
GTGAGCAAGC TGAAGGAGCA GTTCAAGAAC AAGACCATCG TGTTCAACCA GAGCAGCGGC 
GGCGACCCCG AGATCGTGAT GCACAGCTTC AACTGCGGCG GCGAATTCTT CTACTGCAAC 
ACCAGCCCCC TGTTCAACAG CACCTGGAAC GGCAACAACA CCTGGAACAA CACCACCGGC 
AGCAACAACA ATATTACCCT CCAGTGCAAG ATCAAGCAGA TCATCAACAT GTGGCAGGAG 
GTGGGCAAGG CCATGTACGC CCCCCCCATC GAGGGCCAGA TCCGGTGCAG CAGCAACATC 
ACCGGTCTGC TGCTGACCCG CGACGGCGGC AAGGACACCG ACACCAACGA CACCGAAATC 
TTCCGCCCCG GCGGCGGCGA CATGCGCGAC AACTGGAGAT CTGAGCTGTA CAAGTACAAG 
GTGGTGACGA TCGAGCCCCT GGGCGTGGCC CCCACCAAGG CCAAGCGCCG CGTGGTGCAG 
CGCGAGAAGC GGGCCGCCAT CGGCGCCCTG TTCCTGGGCT TCCTGGGGGC GGCGGGCAGC 
ACCATGGGGG CCGCCAGCGT GACCCTGACC GTGCAGGCCC GCCTGCTCCT GAGCGGCATC 
GTGCAGCAGC AGAACAACCT CCTCCGCGCC ATCGAGGCCC AGCAGCATAT GCTCCAGCTC 
ACCGTGTGGG GCATCAAGCA GCTCCAGGCC CGCGTGCTGG CCGTGGAGCG CTACCTGAAG 
GACCAGCAGC TCCTGGGCTT CTGGGGCTGC TCCGGCAAGC TGATCTGCAC CACCACGGTA 
CCCTGGAACG CCTCCTGGAG CAACAAGAGC CTGGACGACA TCTGGAACAA CATGACCTGG 
ATGCAGTGGG AGCGCGAGAT CGATAACTAC ACCAGCCTGA TCTACAGCCT GCTGGAGAAG 
AGCCAGACCC AGCAGGAGAA GAACGAGCAG GAGCTGCTGG AGCTGGACAA CTGGGCGAGC 
CTGTGGAACT GGTTCGACAT CACCAACTGG CTGTGGTACA TCAAAATCTT CATCATGATT 
GTGGGCGGCC TGGTGGGCCT CCGCATCGTG TTCGCCGTGC TGAGCATCGT GAACCGCGTG 
CGCCAGGGCT ACAGCCCCCT GAGCCTCCAG ACCCGGCCCC CCGTGCCGCG CGGGCCCGAC 
CGCCCCGAGG GCATCGAGGA GGAGGGCGGC GAGCGCGACC GCGACACCAG CGGCAGGCTC 
GTGCACGGCT TCCTGGCGAT CATCTGGGTC GACCTCCGCA GCCTGTTCCT GTTCAGCTAC 
CACCACCGCG ACCTGCTGCT GATCGCCGCC CGCATCGTGG AACTCCTAGG CCGCCGCGGC 
TGGGAGGTGC TGAAGTACTG GTGGAACCTC CTCCAGTATT GGAGCCAGGA GCTGAAGTCC 
AGCGCCGTGA GCCTGCTGAA CGCCACCGCC ATCGCCGTGG CCGAGGGCAC CGACCGCGTG 
ATCGAGGTGC TCCAGAGGGC CGGGAGGGCG ATCCTGCACA TCCCCACCCG CATCCGCCAG 
GGGCTCGAGA GGGCGCTGCT G 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



ATGAATCCAG TAATAAGTAT AACATTATTA TTAAGTGTAT TACAAATGAG TAGAGGACAA 
AGAGTAATAA GTTTAACAGC ATGTTTAGTA AATCAAAATT TGAGATTAGA TTGTAGACAT 
GAAAATAATA CACCTTTGCC AATACAACAT GAATTTTCAT TAACGCGTGA AAAAAAAAAA 
CATGTATTAA GTGGAACATT AGGAGTACCA GAACATACAT ATAGAAGTAG AGTAAATTTG 
TTTAGTGATA GATTCATAAA AGTATTAACA TTAGCAAATT TTACAACAAA AGATGAAGGA 
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840 
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GATTATATGT GTGAGCTCAG AGTAAGTGGA CAAAATCCAA CAAGTAGTAA TAAAACAATA 3 60 

AATGTAATAA GAGATAAATT AGTAAAATGT GGAGGAATAA GTTTATTAGT ACAAAATACA 42 0 

AGTTGGTTAT TATTATTATT ATTAAGTTTA AGTTTTTTAC AAGCAACAGA TTTTATAAGT 480 

TTATGA 4 86 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ATGAACCCAG TCATCAGCAT CACTCTCCTG CTTTCAGTCT TGCAGATGTC CCGAGGACAG 60 

AGGGTGATCA GCCTGACAGC CTGCCTGGTG AACAGAACCT TCGACTGGAC TGCCGTCATG 120 

AGAATAACAC CAACTTGCCC ATCCAGCATG AGTTCAGCCT GACCCGAGAG AAGAAGAAGC 180 

ACGTGCTGTC AGGCACCCTG GGGGTTCCCG AGCACACTTA CCGCTCCCGC GTCAACCTTT 240 

TCAGTGACCG CTTTATCAAG GTCCTTACTC TAGCCAACTT GACCAC CAAG GATGAGGGCG 3 00 

AC TACATGTG TGAACTTCGA GTCTCGGGCC AGAATCCCAC AAGCTCCAAT AAAACTATCA 360 

ATGTGATCAG AGACAAGCTG GTCAAGTGTG GTGGCATAAG CCTGCTGGTT CAAAACACTT 42 0 

CCTGGCTGCT GCTGCTCCTG CTTTCCCTCT CCTTCCTCCA AGCCACGGAC TTCATTTCTC 4 80 

TGTGA 4 85 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CGCGGGGCTA GCGCAAAGAG TAATAAGTTT AAC 3 3 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CGCGGATCCC TTGTATTTTG TACTAATA 2 8 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

GAATTCACGC GTAAGCTTGC CGCCACCATG GTGAGCAAGG GCGAGGAGCT GTTCACCGGG 60 

GTGGTGCCCA TCCTGGTCGA GCTGGACGGC GACGTGAACG GCCACAAGTT CAGCGTGTCC 12 0 

GGCGAGGGCG AGGGCGATGC CACCTACGGC AAGCTGACCC TGAAGTTCAT CTGCACCACC 180 

GGCAAGCTGC CCGTGCCCTG GCCCACCCTC GTGACCACCT TCAGCTACGG CGTGCAGTGC 24 0 

TTCAGCCGCT ACCCCGACCA CATGAAGCAG CACGACTTCT TCAAGTCCGC CATGCCCGAA 3 00 

GGCTACGTCC AGGAGCGCAC CATCTTCTTC AAGGACGACG GCAACTACAA GACCCGCGCC 3 60 

GAGGTGAAGT TCGAGGGCGA CACCCTGGTG AACCGCATCG AGCTGAAGGG CATCGACTTC 42 0 

AAGGAGGACG GCAACATCCT GGGGCACAAG CTGGAGTACA ACTACAACAG CCACAACGTC 4 80 

TATATCATGG CCGACAAGCA GAAGAACGGC ATCAAGGTGA ACTTCAAGAT CCGCCACAAC 54 0 

ATCGAGGACG GCAGCGTGCA GCTCGCCGAC CACTACCAGC AGAACACCCC CATCGGCGAC 600 

GGCCCCGTGC TGCTGCCCGA CAACCACTAC CTGAGCACCC AGTCCGCCCT GAGCAAAGAC 660 

CCCAACGAGA AGCGCGATCA CATGGTCCTG CTGGAGTTCG TGACCGCCGC CGGGATCACT 72 0 

CACGGCATGG ACGAGCTGTA CAAGTAAAGC GGCCGCGGAT CC 762 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

AAGCTTAAAC CATGCCCATG GGGTCTCTGC AACCGCTGGC CACCTTGTAC CTGCTGGGGA 60 

TGCTGGTCGC TTCCGTGCTA GCCGCCACCA GAAGATACTA CCTGGGTGCA GTGGAACTGT 12 0 

CATGGGACTA TATGCAAAGT GATCTCGGTG AGCTGCCTGT GGACGCAAGA TTTCCTCCTA 180 

GAGTGCCAAA ATCTTTTCCA TTCAACACCT CAGTCGTGTA CAAAAAGACT CTGTTTGTAG 24 0 

AATTCACGGA TCACCTTTTC AACATCGCTA AGCCAAGGCC ACCCTGGATG GGTCTGCTAG 3 00 

GTCCTACCAT CCAGGCTGAG GTTTATGATA CAGTGGTCAT TACACTTAAG AACATGGCTT 3 60 

CCCATCCTGT CAGTCTTCAT GCTGTTGGTG TATCCTACTG GAAAGCTTCT GAGGGAGCTG 42 0 

AATATGATGA TCAGACCAGT CAAAGGGAGA AAGAAGATGA TAAAGTCTTC CCTGGTGGAA 4 80 

GC CATACAT A TGTCTGGCAG GTCCTGAAAG AGAATGGTCC AATGGCCTCT GACCCACTGT 540 

GCCTTACCTA CTCATATCTT TCTCATGTGG ACCTGGTAAA AGACTTGAAT TCAGGCCTCA 600 

TTGGAGCCCT ACTAGTATGT AGAGAAGGGA GTCTGGCCAA GGAAAAGACA CAGACCTTGC 660 

ACAAATTTAT ACTACTTTTT GCTGTATTTG ATGAAGGGAA AAGTTGGCAC TCAGAAACAA 72 0 

AGAACTCCTT GATGCAGGAT AGGGATGCTG CATCTGCTCG GGCCTGGCCT AAAATGCACA 780 

CAGTCAATGG TTATGTAAAC AGGTCTCTGC CAGGTCTGAT TGGATGCCAC AGGAAATCAG 840 

TCTATTGGCA TGTGATTGGA ATGGGCACCA CTCCTGAAGT GCACTCAATA TTCCTCGAAG 900 

GTCACACATT TCTTGTGAGG AACCATCGCC AGGCGTCCTT GGAAATCTCG CCAATAACTT 960 

TCCTTACTGC TCAAACACTC TTGATGGACC TTGGACAGTT TCTACTGTTT TGTCATATCT 102 0 

CTTCCCACCA ACATGATGGC ATGGAAGCTT ATGTCAAAGT AGACAGCTGT CCAGAGGAAC 1080 

CCCAACTACG AATGAAAAAT AATGAAGAAG CGGAAGACTA TGATGATGAT CTTACTGATT 1140 

CTGAAATGGA TGTGGTCAGG TTTGATGATG ACAACTCTCC TTCCTTTATC CAAATTCGCT 12 00 

CAGTTGCCAA GAAGCATCCT AAAACTTGGG TACATTACAT TGCTGCTGAA GAGGAGGACT 12 60 

GGGACTATGC TCCCTTAGTC CTCGCCCCCG ATGACAGAAG TTATAAAAGT CAATATTTGA 132 0 

ACAATGGCCC TCAGCGGATT GGTAGGAAGT ACAAAAAAGT CCGATTTATG GCATACACAG 1380 

ATGAAACCTT TAAGACTCGT GAAGCTATTC AGCATGAATC AGGAATCTTG GGACCTTTAC 1440 

TTTATGGGGA AGTTGGAGAC ACACTGTTGA TTATATTTAA GAATCAAGCA AGCAGACCAT 150 0 

ATAACATCTA CCCTCACGGA ATCACTGATG TCCGTCCTTT GTATTCAAGG AGATTACCAA 1560 

AAGGTGTAAA ACATTTGAAG GATTTTCCAA TTCTGCCAGG AGAAATATTC AAATATAAAT 162 0 

GGACAGTGAC TGTAGAAGAT GGGCCAACTA AATCAGATCC TCGGTGCCTG ACCCGCTATT 1680 

ACTCTAGTTT CGTTAATATG GAG AG AG AT C TAGCTTCAGG ACTCATTGGC CCTCTCCTCA 174 0 

TCTGCTACAA AGAATCTGTA GATCAAAGAG G AAAC C AG AT AATGTCAGAC AAGAGGAATG 1800 

TCATCCTGTT TTCTGTATTT GATGAGAACC GAAGCTGGTA CCTCACAGAG AATATACAAC 1860 

GCTTTCTCCC CAATCCAGCT GGAGTGCAGC TTGAGGATCC AGAGTTCCAA GCCTCCAACA 192 0 

TCATGCACAG CATCAATGGC TATGTTTTTG ATAGTTTGCA GTTGTCAGTT TGTTTGCATG 1980 

AGGTGGCATA CTGGTACATT CTAAGCATTG GAGCACAGAC TGACTTCCTT TCTGTCTTCT 2 040 

TCTCTGGATA TACCTTCAAA CACAAAATGG TCTATGAAGA CACACTCACC CTATTCCCAT 2100 

TCTCAGGAGA AACTGTCTTC ATGTCGATGG AAAACCCAGG TCTATGGATT CTGGGGTGCC 2160 

ACAACTCAGA CTTTCGGAAC AGAGGCATGA CCGCCTTACT GAAGGTTTCT AGTTGTGACA 222 0 




AGAACACTGG TGATTATTAC GAGGACAGTT ATGAAGATAT TTCAGCATAC TTGCTGAGTA 2 2 80 

AAAACAATGC CATTGAACCA AGAAGCTTCT CCCAGAATTC AAGACACCCT AGCACTAGGC 234 0 

AAAAGCAATT TAATGCCACC CCACCAGTCT TGAAACGCCA TCAACGGGAA ATAACTCGTA 2400 

CTACTCTTCA GTCAGATCAA GAGGAAATTG ACTATGATGA TACCATATCA GTTGAAATGA 2460 

AGAAGGAAGA TTTTGACATT TATGATGAGG ATGAAAATCA GAGCCCCCGC AGCTTTCAAA 252 0 

AGAAAACACG ACACTATTTT ATTGCTGCAG TGGAGAGGCT CTGGGATTAT GGGATGAGTA 2 580 

GCTCCCCACA TGTTCTAAGA AACAGGGCTC AGAGTGGCAG TGTCCCTCAG TTCAAGAAAG 2640 

TTGTTTTCCA GGAATTTACT GATGGCTCCT TTACTCAGCC CTTATACCGT GGAGAACTAA 2700 

ATGAACATTT GGGACTCCTG GGGCCATATA TAAGAGCAGA AGTTGAAGAT AATATCATGG 2760 

TAACTTTCAG AAATCAGGCC TCTCGTCCCT ATTCCTTCTA TTCTAGCCTT ATTTCTTATG 2 820 

AGGAAGATCA GAGGCAAGGA GCAGAACCTA GAAAAAACTT TGTCAAGCCT AATGAAACCA 2 880 

AAACTTACTT TTGGAAAGTG CAACATCATA TGGCACCCAC TAAAGATGAG TTTGACTGCA 2 940 

AAGCCTGGGC TTATTTCTCT GATGTTGACC TGGAAAAAGA TGTGCACTCA GGCCTGATTG 3 000 

GACCCCTTCT GGTCTGCCAC ACTAACACAC TGAACCCTGC TCATGGGAGA CAAGTGACAG 3 060 

TACAGGAATT TGCTCTGTTT TTCACCATCT TTGATGAGAC CAAAAGCTGG TACTTCACTG 312 0 

AAAATATGGA AAGAAACTGC AGGGCTCCCT GCAATATCCA GATGGAAGAT CCCACTTTTA 3180 

AAGAGAATTA TCGCTTCCAT GCAATCAATG GCTACATAAT GGATACACTA CCTGGCTTAG 3240 

TAATGGCTCA GGATCAAAGG ATTCGATGGT ATCTGCTCAG CATGGGCAGC AATGAAAACA 3 3 00 

TCCATTCTAT TCATTTCAGT GGACATGTGT TCACTGTACG AAAAAAAGAG GAGTATAAAA 3 360 

TGGCACTGTA CAATCTCTAT CCAGGTGTTT TTGAGACAGT GGAAATGTTA CCATCCAAAG 342 0 

CTGGAATTTG GCGGGTGGAA TGCCTTATTG GCGAGCATCT ACATGCTGGG ATGAGCACAC 34 80 

TTTTTCTGGT GTACAGCAAT AAGTGTCAGA CTCCCCTGGG AATGGCTTCT GGACACATTA 3 540 

GAGATTTTCA GATTACAGCT TCAGGACAAT ATGGACAGTG GGCCCCAAAG CTGGCCAGAC 3 600 

TTCATTATTC CGGATCAATC AATGCCTGGA GCACCAAGGA GCCCTTTTCT TGGATCAAGG 3660 

TGGATCTGTT GGCACCAATG ATTATTCACG GCATCAAGAC CCAGGGTGCC CGTCAGAAGT 3 72 0 

TCTCCAGCCT CTACATCTCT CAGTTTATCA TCATGTATAG TCTTGATGGG AAGAAGTGGC 3 780 

AGACTTATCG AGGAAATTCC ACTGGAACCT TAATGGTCTT CTTTGGCAAT GTGGATTCAT 3 84 0 

CTGGGATAAA ACACAATATT TTTAACCCTC CAATTATTGC TCGATACATC CGTTTGCACC 3 900 

CAACTCATTA TAGCATTCGC AGCACTCTTC GCATGGAGTT GATGGGCTGT GATTTAAATA 3960 

GTTGCAGCAT GCCATTGGGA ATGGAGAGTA AAGCAATATC AGATGCACAG ATTACTGCTT 4 02 0 

CATCCTACTT TACCAATATG TTTGCCACCT GGTCTCCTTC AAAAGCTCGA CTTCACCTCC 4 080 

AAGGGAGGAG TAATGCCTGG AGACCTCAGG TGAATAATCC AAAAGAGTGG CTGCAAGTGG 4140 

ACTTCCAGAA GACAATGAAA GTCACAGGAG TAACTAC TC A GGGAGTAAAA TCTCTGCTTA 42 00 

CCAGCATGTA TGTGAAGGAG TTCCTCATCT CCAGCAGTCA AGATGGCCAT CAGTGGACTC 4260 

TCTTTTTTCA GAATGGCAAA GTAAAGGTTT TTCAGGGAAA TCAAGACTCC TTCACACCTG 432 0 

TGGTGAACTC TCTAGACCCA CCGTTACTGA CTCGCTACCT TCGAATTCAC CCCCAGAGTT 43 80 

GGGTGCACCA GATTGCCCTG AGGATGGAGG TTCTGGGCTG CGAGGCACAG GACCTCTACT 444 0 

GAGGGTGGCC ACTGCAGCAC CTGCCACTGC CGTCACCTCT CCCTCCTCAG CTCCAGGGCA 45 00 

GTGTCCCTCC CTGGCTTGCC TTCTACCTTT GTGCTAAATC CTAGCAGACA CTGCCTTGAA 4560 

GCCTCCTGAA TTAACTATCA TCAGTCCTGC ATTTCTTTGG TGGGGGGCCA GGAGGGTGCA 462 0 

TCCAATTTAA CTTAACTCTT ACCGTCGACC TGCAGGCCCA ACGCGGCCGC 467 0 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

• ,** * 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

AAGCTTAAAC CATGCCCATG GGGTCTCTGC AACCGCTGGC CACCTTGTAC CTGCTGGGGA 60 

TGCTGGTCGC TTCCGTGCTA GCCGCCACCC GCCGCTACTA CCTGGGCGCC GTGGAGCTGT 12 0 

CCTGGGACTA CATGCAGAGC GACCTGGGCG AGCTCCCCGT GGACGCCCGC TTCCCCCCCC 180 

GCGTGCCCAA GAGCTTCCCC TTCAACACCA GCGTGGTGTA CAAGAAAACC CTGTTCGTGG 240 

AGTTCACCGA CCACCTGTTC AACATTGCCA AGCCGCGCCC CCCCTGGATG GGCCTGCTGG 300 

GCCCCACCAT CCAGGCCGAG GTGTACGACA CCGTGGTGAT CACCCTGAAG AACATGGCCA 360 

GCCACCCCGT CAGCCTGCAC GCCGTGGGCG TGAGCTACTG GAAGGCCAGC GAGGGCGCCG 420 

AGTACGACGA CCAGACGTCC CAGCGCGAGA AGGAGGACGA CAAGGTGTTC CCGGGGGGGA 4 80 

GCCACACCTA CGTGTGGCAG GTGCTTAAGG AGAACGGCCC TATGGCCAGC GACCCCCTGT 54 0 

GCCTGACCTA CAGCTACCTG AGCCACGTGG ACCTGGTGAA GGATCTGAAC AGCGGGCTGA 600 

TCGGCGCCCT GCTGGTGTGT CGCGAGGGCA GCCTGGCCAA GGAGAAAACC CAGACCCTGC 660 



ACAAGTTCAT CCTGCTGTTC GCCGTGTTCG ACGAGGGGAA GAGCTGGCAC AGCGAGACTA 72 0 

AGAACAGCCT GATGCAGGAC CGCGACGCCG CCAGCGCCCG CGCCTGGCCC AAGATGCACA 780 

CCGTTAACGG CTACGTGAAC CGCAGCCTGC CCGGCCTGAT CGGCTGCCAC CGCAAGAGCG 840 

TGTACTGGCA CGTCATCGGC ATGGGCACCA CCCCTGAGGT GCACAGCATC TTCCTGGAGG 900 

GCCACACCTT CCTGGTGCGC AACCACCGCC AGGCCAGCCT GGAGATCAGC CCCATCACCT 960 

TCCTGACTGC CCAGACCCTG CTGATGGACC TAGGCCAGTT CCTGCTGTTC TGCCACATCA 102 0 

GCAGCCACCA GCACGACGGC ATGGAGGCTT ACGTGAAGGT GGACAGCTGC CCCGAGGAGC 10 80 

CCCAGCTGCG CATGAAGAAC AACGAGGAGG CCGAGGACTA CGACGACGAC CTGACCGACA 114 0 

GCGAGATGGA TGTCGTACGC TTCGACGACG ACAACAGCCC CAGCTTCATC CAGATCCGCA 12 00 

GCGTGGCCAA GAAGCACCCT AAGACCTGGG TGCACTACAT CGCCGCCGAG GAGGAGGACT 1260 

GGGACTACGC CCCGCTAGTA CTGGCCCCCG ACGACCGCAG CTACAAGAGC CAGTACCTGA 13 2 0 

ACAACGGCCC CCAGCGCATC GGCCGCAAGT ACAAGAAGGT GCGCTTCATG GCCTACACCG 13 80 

ACGAGACTTT CAAGACCCGC GAGGCCATCC AGCACGAGTC CGGCATCCTC GGCCCCCTGC 144 0 

TGTACGGCGA GGTGGGCGAC ACCCTGCTGA TCATCTTCAA GAACCAGGCC AGCAGGCCCT 1500 

ACAACATCTA CCCCCACGGC ATCACCGACG TGCGCCCCCT GTACAGCCGC CGCCTGCCCA 1560 

AGGGCGTGAA GCACCTGAAG GACTTCCCCA TCCTGCCCGG CGAGATCTTC AAGTACAAGT 162 0 

GGACCGTGAC CGTGGAGGAC GGCCCCACCA AGAGCGACCC CCGCTGCCTG ACCCGCTACT 168 0 

ACAGCAGCTT CGTGAACATG GAGCGCGACC TGGCCTCCGG ACTGATCGGC CCCCTGCTGA 174 0 

TCTGCTACAA GGAGAGCGTG GACCAGCGCG GCAACCAGAT CATGAGCGAC AAGCGCAACG 1800 

TGATCCTGTT CAGCGTGTTC GACGAGAACC GCAGCTGGTA TCTGACCGAG AACATCCAGC 1860 

GCTTCCTGCC CAACCCCGCT GGCGTGCAGC TGGAAGATCC CGAGTTCCAG GCCAGCAACA 1920 

TCATGCACAG CATCAACGGC TACGTGTTCG ACAGCCTGCA GCTGAGCGTG TGCCTGCATG 198 0 

AGGTGGCCTA CTGGTACATC CTGAGCATCG GCGCCCAGAC CGACTTCCTG AGCGTGTTCT 2 040 

TCTCCGGGTA TACCTTCAAG CACAAGATGG TGTACGAGGA CACCCTGACC CTGTTCCCCT 2100 

TCTCCGGCGA GACTGTGTTC ATGTCTATGG AGAACCCCGG CCTGTGGATT CTGGGCTGCC 2160 

ACAACAGCGA CTTCCGCAAC CGCGGCATGA CTGCCCTGCT GAAAGTCTCC AGCTGCGACA 22 2 0 

AGAACACCGG CGACTACTAC GAGGACAGCT ACGAGGACAT CTCCGCCTAC CTGCTGTCCA 22 80 

AGAACAACGC CATCGAGCCC CGCTCCTTCT CCCAAAACTC CCGCCACCCC AGCACGCGTC 2340 

AGAAGCAGTT CAACGCCACC CCCCCCGTGC TGAAGCGCCA CCAGCGCGAG ATCACCCGCA 24 0 0 

CCACCCTGCA AAGCGACCAG GAGGAGATCG ACTACGACGA CACCATCAGC GTGGAGATGA 24 60 

AGAAGGAGGA CTTCGACATC TACGACGAGG ACGAGAACCA GAGCCCCCGC TCCTTCCAAA 252 0 

AGAAAACCCG CCACTACTTC ATCGCCGCCG. TGGAGCGCCT GTGGGACTAC GGCATGAGCA 2 580 

GCAGCCCCCA CGTCCTGCGC AACCGCGCCC AGAGCGGCAG CGTGCCCCAG TTCAAGAAGG 2 640 

TGGTGTTCCA GGAGTTCACC GACGGCAGCT TCACCCAGCC CCTGTACCGC GGCGAGCTGA 2 700 

ACGAGCACCT GGGCCTGCTC GGCCCCTACA TCCGCGCCGA GGTGGAGGAC AACATCATGG 2760 

TGACCTTCCG CAACCAAGCC TCCCGGCCCT ACTCCTTCTA CTCCTCCCTG ATCAGCTACG 2 82 0 

AGGAGGACCA GCGCCAGGGC GCCGAGCCCC GCAAGAACTT CGTGAAGCCC AACGAGACTA 2 880 

AGACCTACTT CTGGAAGGTG CAGCACCACA TGGCCCCCAC CAAGGACGAG TTCGACTGCA 2 940 

AGGCCTGGGC CTACTTCAGC GACGTGGACC TGGAGAAGGA CGTGCACAGC GGCCTGATCG 3 000 

GCCCCCTGCT GGTGTGCCAC ACCAACACCC TGAACCCCCC CCACGGGAGG CAGGTGACTG 3 060 

TGCAGGAATT TGCCCTGTTC TTCACCATCT TCGACGAGAC TAAGAGCTGG TACTTCACCG 312 0 

AGAACATGGA GCGCAACTGC CGCGCCCCCT GCAACATCCA GATGGAAGAT CCCACCTTCA 3180 

AGGAGAACTA CCGCTTCCAC GCCATCAACG GCTACATCAT GGACACCCTG CCCGGCCTGG 324 0 

TGATGGCCCA GGACCAGCGC ATCCGCTGGT ACCTGCTGTC TATGGGCAGC AACGAGAACA 3 3 00 

TCCACAGCAT CCACTTCAGC GGCCACGTTT TCACCGTGCG CAAGAAGGAG GAGTACAAGA 3 360 

TGGCCCTGTA CAACCTGTAC CCCGGCGTGT TCGAGACTGT GGAGATGCTG CCCAGCAAGG 342 0 

CCGGGATCTG GCGCGTGGAG TGCCTGATCG GCGAGCACCT GCACGCCGGC ATGAGCACCC 34 80 

TGTTCCTGGT GTACAGCAAC AAGTGCCAGA CCCCCCTGGG CATGGCCAGC GGCCACATCC 3540 

GCGACTTCCA GATCACCGCC AGCGGCCAGT ACGGCCAGTG GGCTCCCAAG CTGGCCCGCC 3 600 

TGCACTACAG CGGCAGCATC AACGCCTGGT CGACCAAGGA GCCCTTCTCC TGGATCAAGG 3 660 

TGGACCTGCT GGCCCCCATG ATCATCCACG GCATCAAGAC CCAGGGCGCC CGCCAGAAGT 3 72 0 

TCAGCAGCCT GTACATCAGC CAGTTCATCA TCATGTACTC TCTAGACGGC AAGAAGTGGC 3 780 

AGACCTACCG CGGCAACAGC ACCGGCACCC TGATGGTGTT CTTCGGCAAC GTGGACAGCA 3 840 

GCGGCATCAA GCACAACATC TTCAACCCCC CCATCATCGC CCGCTACATC CGCCTGCACC 3 900 

CCACCCACTA CAGCATCCGC AGCACCCTGC GCATGGAGCT GATGGGCTGC GACCTGAACA 3 960 

GCTGCAGCAT GCCCCTGGGC ATGGAGAGCA AGGCCATCAG CGACGCCCAG ATCACCGCCT 402 0 

CCAGCTACTT CACCAACATG TTCGCCACCT GGAGCCCCAG CAAGGCCCGC CTGCACCTGC 40 80 

AGGGCCGCAG CAACGCCTGG CGCCCCCAGG TGAACAACCC CAAGGAGTGG CTGCAGGTGG 414 0 

ACTTCCAGAA AACCATGAAG GTGACTGGCG TGACCACCCA GGGCGTCAAG AGCCTGCTGA 42 00 

CCAGCATGTA CGTGAAGGAG TTCCTGATCA GCAGCAGCCA GGACGGCCAC CAGTGGACCC 4260 

TGTTCTTCCA AAACGGCAAG GTGAAGGTGT TCCAGGGCAA CCAGGACAGC TTCACACCGG 432 0 

TCGTGAACAG CCTGGACCCC CCCCTGCTGA CCCGCTACCT GCGCATCCAC CCCCAGAGCT 43 80 

GGGTGCACCA GATCGCCCTG CGCATGGAGG TGCTGGGCTG CGAGGCCCAG GACCTGTACT 444 0 

GAAGCGGCCG C 4451 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
GGGGATCCTC ACGTCTCA 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
CTGCTTCTGA CGCGTGCTGG GGTGGCGGGA GTT 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 
CTGCTGAAAG TCTCCAGCTG C 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 
GGCAGGTGCT TAAGGAGAAC GGCCCTATGG CCA 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 



